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Mapping Table for Unicode Conversion 



Header: Fixed Identification Fields 



Unicode character corresponding to 
source character with encoding 0 



Unicode character corresponding to 
source character with encoding 1 
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source character with encoding k 
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Begin 
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Determine code page or pages required 
for tliis source string's conversion 



300 
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Load mapping table 



Yes 



314 



Determine number of characters 
(NCS) in the source string 




316 



Set count of source characters (K) to 1 



Issue error 
message 



310 



318 



Get K-th source character 



O 



Use binary value of K-th character to index 
into K-th entry in the chosen Mapping table, 
to get K-th Unicode character; store it in "target" field 



Set K to K+1 
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Begin 



402 
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Initialize: set scan pointer to start of nominal-value in source string; 

set SBCS mode 
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i :2 
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Step scan pointer by 1 byte 
(discard the SO character); 
set DBCS mode 
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Source character is the byte 
at the scan pointer position 
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Source character is 2 bytes 
at the scan pointer position 
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Translate source character 

to Unicode 
using SBCS mapping table 



Translate source character 
to Unicode using 
DBCS mapping table 



I 



step scan pointer by 2 bytes 
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Set SBCS mode 



Step scan pointer by 1 byte 
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Translation of source string completed 
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End 
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Title 'DCU — a macro to generate Unicode constants' 
Macro 

&L DCU &A,&Pair=Yes,&CodePage=500 



* 



* 



Expected argument: an apostrophe-delimited string of one or 
more EBCDIC characters, with paired internal apostrophes and 
ampersands. The pairing is preserved in the output string if 
&Pair=Yes, and is not if &Pair=No. 

Initial limitation: max of 63 characters in quoted argument, 
except for paired characters if &Pair=No. 

Declare variables used internally 



LcIC &M(256) Mapping and validation table 

LcIC &V Valid EBCDIC characters 

LcIC &R Result Unicode string 

LclB &P True if '& pairs retained in output 

LcIA &J Counter 

LcIA &N Temp 



Validate macro arguments 



Alf (N'&SysLlst gt 0).V1 Check for argument 

MNote 8,'DCU — No argument.' 

MExit 



.V1 Alf (N'&SysList It 2).V2 Check single argument 
MNote 8,'DCU — More than one argument.' 
MExit 



.V2 Alf (K'&Age3).V3 

MNote 8,'DCU — argument too short, or badly formed.' 
MExit 



.V3 Alf ('&A'(1,1) eq "" and '&A'(K'&A,1) eq "").V4 
MNote 8,'DCU — argument not properly quoted.' 
MExit 

* 

.V4 Alf ('YES' eq (Upper '&Pair')).V5 Check if pairing wanted 
Alf ('NO' eq (Upper '&Pair')).V6 Check if no pairing 
MNote 8,'DCU — invalid value of &&Pair.' 
MExit 

.V5 ANop , 

&P SetB 1 indicate no pairing of '& in output 
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^ 600 

.V6 ANop , 

Alf C&CodePage' eq '500').V7 Check code page 

MNot 8,'DCU — Code Page &Cod Page not supported yet.' 

MExit 

* 

.V7 ANop , 



Arguments validated. Set SBCS and Unicode character sets * 



.VX ANop , Set up mapping table 

&J SetA &J-I-1 

&M(&J) SetC " Initialize to null 

Alf (&J It 256).VX Loop for all 256 code points 

* 

&V SetC '0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijldmnopqrs* 
tuvwxyz_@#$%&&*()-+=,./: ;"<>"? ' 

The following is the conversion table from CCSID 500 to Unicode 

★ 

&U SetC '3031 323334353637383941 42434445464748494A4B4C4D4E4F5051 5* 
2535455565758595A61 62636465666768696A6B6C6D6E6F7071 72737* 
475767778797A5F4023242526262A28292D2B3D2C2E2F3A3B273C3E2* 

23F20' 

* 

Note: Conditional-assembly string constants require paired 
.* apostrophes and ampersands; ampersands are not reduced to a 
.* single character internally. Thus, the encoding for & appears 
* twice in the &U encoding string above. 

★ 

&J SetA 1 



Build the EBCDIC-to-Unicode mapping table 



.VY ANop 

&C SetC (Double '&V'(&J,1)) Pick character from valid string 

&C SetC 'C&C'" Character in self-defining term 

&N SetA &C+1 Convert to numeric 

&M(&N) SetC '&U'(2*&J-1,2) Put Unicode digits in mapping table 

&J SetA &J->-1 Increment &J 

Alf (&J le K'&V).VY Set up all valid encodings 
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* 



Convert each SBCS argument character to Unicod equivalent 



&J SetA 2 Start after initial apostrophe 

.Z ANop , Head of translation loop 

&C SetC '&A'(&J,1) &J-th character from argument 

Aif C&C ne and '&C' ne •&&'(1,1)).Z1 Is it '& ? 

Alf (&P).Z1 Have '&, is pairing wanted? 

&J SetA &J+1 No pairing, step input by one 
.Z1 ANop , 

&C SetC (Double '&C') Pair '& for self-defining term 

&C SetC 'C'&C'" Change to arithmetic value 

&N SetA &C+1 Convert to numeric 

&C SetC '&M(&N)' Get Unicode mapping 

Alf C&C ne ").Z2 Validly mappable if not null 
MNote 4,'DCU — Unknown character at position &J converte* 
d to blank.' 

&C SetC '20' Unicode blank 

★ 

.Z2 ANop , 

&R SetC '&R.OO&C' Add new character to end 
&J SetA &J+1 

Alf (&J It K'&A).Z Repeat for all internal characters 



* 



Generate the requested Unicode constant 



&L DC X'&R' 
MEnd 
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